Mathematical and Computational Linguistics Project N.4 Syntactic Phylogenetic Trees
نویسنده
چکیده
1. Linguistic Phylogenetic Trees The reconstruction of phylogenetic trees of language families is one of the main problems in Historical Linguistics. In recent years, computational methods have been used, mostly borrowed from similar techniques in mathematical biology, see for instance the collection of papers in [1]. Mostly, the computational reconstructions of linguistic phylogenetic trees followed the original method developed in traditional Historical Linguistics, namely using lexical databases and cognate words. However, recently it was shown in [3] that syntactic parameters can also be efficiently used as data on the basis of which to construct phylogenetic trees. A general introduction to mathematical, computational and statistical methods for the construction of phylogenetic trees and graphs can be found in [2].
منابع مشابه
Mathematical and Computational Linguistics Project N.3 Syntactic Parameters as a Spin Glass Model
The project consists of running simulations of language evolution as a spin-glass model, on a graph with vertices a group of languages and edges representing the interaction between them. The syntactic parameters are viewed as spin variables at the vertices. The current data of syntactic parameters of various languages provide the initial configuration. An extensive database of syntactic parame...
متن کاملMultiway-Tree Retrieval Based on Treegrams
Large tree databases as knowledge repositories become more and more important; a prominent example are the treebanks in computational linguistics: text corpora consisting of up to five million words tagged with syntactic information. Consequently, these large amounts of structured data pose the problem of fast tree retrieval: Given a database T of labeled multiway trees and a query tree q, find...
متن کاملSyntactic Phylogenetic Trees
In light of recent controversies surrounding the use of computational methods for the reconstruction of phylogenetic trees of language families (especially the Indo-European family), a possible approach based on syntactic information, complementing other linguistic methods, appeared as a promising possibility, largely developed in recent years in Longobardi’s Parametric Comparison Method. In th...
متن کاملThe Treegram Index|an Eecient Technique for Retrieval in Linguistic Treebanks under Consideration for Other Conferences (specify)? Acl
In computational linguistics, large tree databases tagged with morpho-syntactic information are in need of fast retrieval of multiway tree structures. To tackle this problem, we present a generalization of the classical n-gram indexing technique called Treegram indexing. As an application of treegram indexing, we describe the Venona retrieval system, which handles the BH t treebank containing 5...
متن کاملRestricted Non-Projectivity: Coverage vs. Efficiency
In the last decade, various restricted classes of non-projective dependency trees have been proposed with the goal of achieving a good tradeoff between parsing efficiency and coverage of the syntactic structures found in natural languages. We perform an extensive study measuring the coverage of a wide range of such classes on corpora of 30 languages under two different syntactic annotation crit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015